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ABSTRACT 

We explore how finite integration times or equivalently temporal binning induces 
morphological distortions to the transit lightcurve. These distortions, if uncorrected 
for, lead to the retrieval of erroneous system parameters and may even lead to some 
planetary candidates being rejected as ostensibly unphysical. We provide analytic ex- 
pressions for estimating the disturbance to the various lightcurve parameters as a 
function of the integration time. These effects are particularly crucial in light of the 
long-cadence photometry often used for discovering new exoplanets by for example 
CoRoT and the Kepler Mission (8.5 and 30 minutes). One of the dominant effects of 
long integration times is a systematic underestimation of the lightcurve derived stellar 
density, which has significant ramifications for transit surveys. We present a discussion 
of numerical integration techniques to compensate for the effects and produce expres- 
sions to quickly estimate the errors of such techniques, as a function of integration 
time and numerical resolution. This allows for an economic choice of resolution before 
attempting fits of long-cadence lightcurves. 
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1 INTRODUCTION 

Transiting extrasolar plane ts have become a powerfu l tool 
since the fir s t disc overy by ICharbonneau et al.l l|2000l ) and 
iHenrv at al.1 (|200Ct l. The detection of a transit allows for a 
measurement of the planet-star ratio-of-radii, which may be 
used to infer the planetary radius once the stellar radius 
has been determined. Multi-wavelength measurements may 
be used to retrieve a spectrum of the light being absorbed 
by the planetary atmosphere and thus a deduction of the 
molecular constituents (e.g. Tinetti et al. 2007). Timing of 
the eclipse minima and duration permits for the detection 
of other bodies in the system (Agol et al. 2005; Holman 
et al. 2005) and companion moons (Kipping 2009a,b). In 
recent years, various authors have published increasingly in- 
genious methods of characterizing exoplanets through tran- 
sits, including thermal emission detections (Deming et al. 
2006), spin-orbit alignment (Winn et al. 2005), oblateness 
measurements (Seager & Hui 2002; Carter et al. 2010) and 
many more. 
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As we look for these small whispers of the exoplanet's 
nature, we are in fact examining the morphology of the tran- 
sit lightcurve in increasing detail. It is therefore paramount 
that the methods for modeling these lightcurves are in the 
most precise form possible. 

In this paper, we explore the effects of finite integra- 
tion time on the transit lightcurve, or equivalently the act 
of temporal binning. Long-cadence (LC) data smears out 
the transit lightcurve signal into a broader shape which will 
lead to an erroneous retrieval of the system parameters, if 
unaccounted for. 

We will first outline the nature of the morphological dis- 
tortions in §2 and then derive analytic expressions for the 
approximate systematic error in the retrieved parameters, as 
a function of integration time. It is shown that the effect can 
be significant enough to even lead to some planetary can- 
didates being rejected as being unphysical, unless the effect 
is correctly accounted for. Obviously, this is critical for mis- 
sions like CoRoT and Kepler which employ LC photometry 
to detect new transiting planets. In §3, we will discuss nu- 
merical techniques to correctly model the transit lightcurve 
and derive the errors of these methods as a function of in- 
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tegration time and numerical resolution. These expressions 
allow for a simple evaluation of the expected consequences 
of finite integration time and thus will guide observers in 
choosing how to deal with this issue. 



transit depth, 5. These parameters may then be used to 
deter mine orbital incl ination, i, and stellar density, p*. Re- 
cently, iKippin j (I2OI0I ) extended these equations to account 
for orbital eccentricity and we will here employ these more 
general expressions in our analysis: 



2 THE EFFECTS OF FINITE INTEGRATION 
TIME 

2.1 Ingress/Egress Durations 

For a transit lightcurve, there are four critical contact points 
which define the overall shape, which represent the points 
where the time derivative is discontinuous. Physically speak- 
ing, contact points I and IV occur when the sky-projected 
planet-star separation is equal to the stellar radius plus to 
planetary radius. Contact points II and III occur when this 
parameter equals the stellar radius minus the planetary ra- 
dius. Defining W as the average of the durations between the 
l=t_to-4* and 2°'*-to-3'''* contacts (Kipping et al. 2010), ic 
as the mid-transit time and r as the ingress/egress duration, 
we have: 
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We here derive what these retrieved parameters would 
be if the effect of finite integration time was ignored. In the 
above expressions, we replace tr &i tp with t'rp — {tr -I- 1) & 
t'p = (tp—T). The new expressions then have the remaining 
tr &i tp terms written out in terms of the true values of b, 
an, etc. This leaves us with b'{b, ur, ...), a'ii{b, an, ...), etc: 



ti = tc- W/2 - r/2 
tii = tc- W/2 + r/2 
tin = tc + W/2 - r/2 
tiv = U + W/2 + T/2 



(1) 
(2) 

(3) 
(4) 



The principal effect of finite integration time is to smear 
out the lightcurve into a broader shape (see figure 1). The 
apparent ingress and egress durations will increase and addi- 
tional curvature will be introduced into the lightcurve wings. 
The ingress/egress stretching can be considered in terms of 
the apparent positions of the contact points being tempo- 
rally shifted from their true value. The magnitude of this 
time shift is dependent on the relative phase difference be- 
tween the sampling and the transit signal. If we assume that 
a large number of transits observed with LC photometry are 
folded about the orbital period, as is typical in transit de- 
tection, then the effect becomes much more predictable with 
the deviation averaging out to 1/2. 

Under these conditions, contact points I and IV will ap- 
pear to move outwards from tc by one half of the integration 
time each, T/2. Conversely, contact points II and III will ap- 
pear to move inwards by the same amount. Let us define the 
apparent contact points as t': 



t'l = ti- T/2 
t'n = tu + 1/2 
t'lii = tin — 1./2 
t'lv = tiv + T/2 



(5) 
(6) 
(7) 
(8) 

ISeager fc Mallen-Ornela^ (|2003t ) showed that several 
physical system parameters may be retrieved by anal- 
ysis of the lightcurve morphology. Specifically, they de- 
fine tr = tiv — ti and tp = tm — tu. Let us de- 
fine th e apparent observed valu e s as t'p and t'p respec- 
tively. ISeager fc Mallen-Ornel3 (|2003l ) showed that the 
semi-major axis of the planetary orbit divided by the stellar 
radius, an, the transit impact parameter, 6, and the ratio- 
of-radii, p, are all determined by manipulation oitp, tp and 
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Setting T = Q returns the original results as expected. 
Unfortunately, these equations are somewhat overly complex 
for us to draw any physical intuition. To proceed, let's con- 
sider a typical case example by using the system parameters 
from one of the Kepler planets, since these are discovered 
using long-cadence data. The following example is for the 
assumption of zero limb darkening, which is a very poor one 
for the Kepler bandpass. The effects of limb darkening will 
be discussed later. 

In figure 2, we plot the retrieved stellar density, p', , as 
a function of the true stellar density, p« . We fix all other pa- 
ramet ers to be that of Kepler-5b, as reported bv lKoch et al.1 
l|2010h . The effect can be seen to be highly significant, caus- 
ing the retrieved stellar density to be underestimated by a 
factor which borders on an order-of-magnitude. This scale of 
underestimation is sufficient to completely reject some plan- 
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Figure 1. A trapezoid approximated lightcurve with a one hour flat-bottom duration and 30 minute ingress/egress duration, t, is shown 
in solid. The dashed line shows the lightcurve morphology for an integration time of 30 minutes. The apparent ingress/ egress duration, 
t' , can be seen to have doubled purely as a consequence of the integration time. 



etary candidates as unphysical. However, we note that in 
reality the underestimation of p* will not be this severe due 
to countering effects of limb darkening suppression discussed 
later in §2.2. 

Looking at figure 2 again, let us explore the phys- 
ical reasons for this underestimation. We begin our line 
of thought by considering the impact parameter, b. If the 
ingress/egress duration is elongated, what do we expect to 
happen to the derived impact parameter? Consider two ex- 
treme cases. When a planet transits with a very low impact 
parameter, we have essentially an equatorial transit. This 
means the vector describing the sky-projected planetary ve- 
locity is nearly perpendicular to the stellar limb. As a result, 
the planet crosses the limb expediently. In contrast, if we 
have a near-grazing transit, the mutual angle between the 
stellar limb and the sky-projected planetary velocity vector 
has become more acute, which has the effect of making the 
limb-crossing time much longer. Therefore, stretching the 
ingress/egress duration causes h' > b. 

Having established this point, consider the effect on an. 
The simplest way to understand the effect on this param- 
eter is to appreciate that b and an exhibit an extremely 
strong negative correlation, as demonstrated bv lCarter et al.l 
lj2008| ). So the act of increasing the ingress/egress duration 
will increase b and therefore decrease ur. Finally, from equa- 
tion (11) we know that p* oc a|j. 



2.2 Limb Darkening Effects 

For transit observations at visible wavelengths, limb darken- 
ing is quite pronounced producing a well-known curvature 
in the flat-bottom part of the lightcurve. For large integra- 
tion times, the curvature is smeared out, producing a flatter 
transit trough morphology. 

If we were trying to fit such a transit lightcurve, and the 
limb darkening parameters are exactly known, our hypothet- 
ical fitting algorithm would move towards a case where limb- 
darkening is least effective. This occurs for near-equatorial 
transits. This is because as the transit impact parameter 
becomes larger and larger, we move closer and closer to the 
limb of the star where limb darkening accentuates. There- 
fore, LC data pushes b' towards a more equatorial value due 
to limb darkening i.e. b' < b. 



2.3 Two Countering Effects 

In conclusion, finite integration times have two countering 
effects on b' and thus a'n and pi as well. The ingress and 
egress smearing causes the ingress to appear larger, which 
occurs for more grazing transits. In contrast, the limb dark- 
ening is suppressed, which occurs for more equatorial tran- 
sits. Whilst limb darkening is important, especially for tran- 
sit surveys like CoRoT and Kepler which operate at visible 
wavelengths, the fundamental change in the transit morphol- 
ogy is sufficiently large that it will tend to dominate. This 
is because the amplitude of the curvature in the lightcurve 
trough is usually at least an order-of-magnitude less than 
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Figure 2. As an example, we use the system parameters of Kepler-5b to show the effect on the retrieved stellar density as a result of 
long integration times, in the case of no limb darkening. The 1 min line appears to produce results within the typical uncertainties of 
the derived stellar density. 



the amplitude of the transit signal itself. Therefore, the con- 
sequences of smearing the overall transit signal will tend to 
dominate over the suppression of limb darkening. 

The general rule of thumb that b' is overestimated will 
break-down when we have r ^ I but tp ~ which 
occurs for near-grazing transits. In such a case, the fractional 
change to the ingress durations is minimized but the change 
in the transit trough curvature is maximized. 

To exactly calculate the net consequence of these two 
effects though, the integration time should be included when 
we generate our model lightcurves rather than attempting 
ad-hoc corrections post-analysis. This appears to be the only 
way to completely account for the effect in a reliable manner. 

2.4 Consequences for the Transit Depth 

We will briefly comment on the effect of finite integration 
time on the transit depth. Assuming no limb darkening is 
present and tp > X, the transit depth is completely un- 
affected by the long integration time. This has important 
consequences for secondary eclipses where the lightcurve is 
unaffacted by stellar limb darkening. 

For cases where we have limb darkening, the net ef- 
fect on the retrieved p' will depend on whether b' > b or 
b < b' . Additionally, the effect will be a function of what 
assumptions were used in the fitting algorithm (e.g. the fix- 
ing of various parameters), the true impact parameter and 
the limb darkening coefficients. Given the large number of 
correlated factors, predicting the effect of integration time 
becomes less reliable and we must fit the transit lightcurve 



with a model which accounts for integration time in the first 
place. 

2.5 Observed Effects with Kepler 

We point out that the effects of long integration times have 
already been observe d by the Kepler Mission. Figure 4 of 
iGilliland et"all (|201(]| '). shows the transit lightcurve of long- 
cadence and short-cadence data for the same planet, TreS- 
2b. The long-cadence lightcurve exhibits a broader shape 
with the apparent position of the contact points shifted by 
~ X/2, as predicted by our model. Notice also that the cur- 
vature in the transit trough, due to limb darkening, has also 
been attenuated. 



3 ACCURATE TRANSIT LIGHTCURVE 
MODELING 

3.1 Analytic Integration 

The critical problem we have outlined can be simply sum- 
marized by the following: Don't fit an unbinned model to 
binned data. The model u sually used to g e nerate a transit 
lightcurve is provided bv iMandel fc Agoll (|2002l ) (MA02), 
which includes the effects of stellar limb darkening. To gen- 
erate the MA02 lightcurve, we usually have a set of time 
stamps forming a time vector t. This time vector repre- 
sents instantaneous moments rather than integrated time. 
The t vector is converted into a vector of instantaneous true 
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anomalies, /, by solving Kepler's equation numerically. / is 
converted to a z array, where z{f) is the instantaneous sky- 
projected planet-star separation. Finally, the MA02 equa- 
tions provide us with F_, where F is the instantaneous flux. 
We summarize the sequence of events as: 



m 



Now that we have established the mechanism of gener- 
ating of a transit lightcurve for instantaneous time stamps, 
_F(t), let us consider what the transit lightcurve for inte- 
grated time stamps would be, F_(t}- In this case, the inte- 
grated flux would be given by: 



£(i) = 



-1/2 ' 
t+I/2 



t=t-I/2 



dt 



(15) 



This equation suffers from the problem that F_ cannot 
be written as a function of t analytically, since such a solu- 
tion would require a closed-form solution to Kepler's equa- 
tion, which is transcendental. Evaluating this expression for 
_F as a function of z is also not possible since we would find 
the following: 



i+X/2 



F{t)dt -- 



t=t-I/2 



F{t{z))^iz)dz (16) 

z(i-X/2) 1^ 



Whilst dz/dt may be evaluated analytically through a 
chain rule expansion of (dz/df) x (d//dt), the resultant ex- 
pression will be as a function of /, rather than z. If we knew 
f{z), then we would be able to write out the integrand in a 
closed-form, but f{z) can only be f o und b y solving a quar- 
tic equation, as shown by Kipping (2008) (K08). Unfortu- 
nately, as discussed in iKippine (2010), there is no currently 
proposed method to correspond which roots refer to which 
orbital conjunction which makes a closed-form expression 
elusive. 

The only remaining hope for a simple analytic expres- 
sion would be to express the integral in terms of true or 
eccentric anomaly, which are inter-changeable. This would 
yield the following integral: 



t+X/2 /./(t+I/2) n, 

F{t)dt^ F{t{f))^if)df (17) 

t=t-X/2 Jf(i-X/2) CI,/ 

The integrand of this expression may be written out in 
a closed-form, by utilizing the solutions of K08: 



t+I/2 



t=t-X/2 



F{t)dt^ / F{z{f)) — ^==D{f)df 

i/(f-i/2) 27r ^1 - e2 

(18) 

Where D{f) is the duration function defined b y K08, 
F(z) is given by MA02 and z{f) is well-known (e.g. IWinnI 
l|2010l )). The integral limits do not possess a closed-form 
solution since once again we must solve Kepler's equation, 
but in principle the indefinite integral could be analytically 
evaluated and then the relevant limits applied after a sub- 
routine provides numerical solutions to Kepler's equation. 
We believe that this strategy would be the most computa- 
tionally efficient since we have obviated the need for any 



numerical integration. However, we were unable to find a 
solution for the indefinite integral for even a uniform-source 
case and will therefore focus the remainder of this discussion 
onto numerical techniques. 

3.2 Numerical Integration 

Having established the significant challenges regarding ana- 
lytic integration, we now turn out attention to the the use 
of numerical integration techniques. The functions we need 
to integrate over are in fact very well-behaved and well- 
approximated by compositions of polynomials and thus we 
anticipate even a low-resolution numerical integration tech- 
nique should provide satisfactory accuracy. 

In this subsection, we first consider the merit of Simp- 
son's Rule or other Newton-Cotes based methods. We aim 
to avoid using nested quadrature methods like the Gauss- 
Kronrod or Clenshaw-Curtis, as the number of integrations 
required is large and we wish to avoid nested methods. For 
the simplest case of Simpson's rule, we have: 

p^^^^,^^FiU-l/2)+4FiU)+FiU+l/2) ^^^^ 

Where A'^ denotes the number of calls needed to the 
MA02 code and essentially is a measure of the resolution of 
our numerical integration. This method may be extended to 
higher intervals by using Simpson's composite rule. Alter- 
natively, we can extend to cubic, quartic, etc interpolations 
by using the Newton-Cotes formulas. Each time we evaluate 
F{t) requires another call to the MA02 subroutine, and thus 
we wish to minimize the number of calls, but maximize the 
accuracy of the employed technique. 

Simpson's composite rule works by splitting up our in- 
tegration range into 2m subintervals, therefore requiring 
N = 2m + 1 calls to the MA02 code. The error on the 
composite Simpson's rule is given by: 



Comp. Simpson 
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Where e is some number between ti — X/2 and ti + 
1/2. In contrast, the Newton-Cotes formulas move through 
increasing orders by increasing the interpolation order. For 
the TV = 4 case (which is the cubic interpolation scenario, 
known as Simpson's 3/8 rule), the equivalent errors between 
the two methods are: 



Newton—- Cote 
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(22) 



Comp. Simpson 

Thus for N — 4, Simpson's composite rule offers greater 
accuracy than the Newton-Cotes based equation. Moving 
through the higher orders in the Newton-Cotes family causes 
the error to have a functional dependence on F^'^\ i.e. the 
A^"^ differential of F. So for AT > 4 it is not possible to 
give an exact comparison between the two methods since 
F^^^ (t) is not known for any A'. Therefore, our only reliable 
comparison is for the A^ = 4 case, from which we conclude 
the composite Simpson's rule is superior in terms of accuracy 
versus computational requirement. 
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3.3 Error in Numerical Integration 

Let us now consider what value of A'^ should be used. There 
are essentially three segments of the lightcurve which exhibit 
curvature and thus would produce the maximum error in 
our numerical integrations, which employ linear piece-wise 
approximations. 

(i) Curvature of the ingress/egress 

(ii) Curvature of the limb-darkened lightcurve trough 

(iii) Discontinuities at the contact points 

The last of these is due to a discontinuous function and 
the former two are due to curvatures within continuous func- 
tions. We will treat these two different sources of 'curvature' 
separately, although from the arguments made earlier, we 
expect the last of these effects to be the largest source of 
numerical error. 



3.3.1 Ingress/ egress curvature 

The transit lightcurve has a depth 5 and an ingress duration 
r. For most of the ingress, the curvature is close to zero and 
essentially mimics a linear slope. Ifowever, near the contact 
points, the slope rapidly changes to a flat line of zero gra- 
dient. Therefore, near the contact points, the ingress/egress 
morphology causes large amounts of curvature. These points 
will exhibit the largest numerical errors in using a technique 
like Simpson's composite rule. 

A suitable choice of resolution can be made by increas- 
ing N until (T^lmax ^ ^-F^obs' i'^' ^ur calculation should pro- 
duce a flux which has a maximum systematic error which 
is less than the observational uncertainty. We will set our 
resolution to a point where it provides satisfactory accuracy 
even at the point of highest numerical error, i.e. within the 
ingress/egress near the contact points. 

Another approach would be to use an adaptive compos- 
ite Simpson's rule, for example like that proposed by McK- 
eeman (i962). However, our preference here is to avoid using 
adaptive routines since they would require a new adaption 
for every single data point and fitting trial, which would 
be time consuming. The costs versus benefits of using such 
a method could warrant further investigation in the future. 
Instead, we choose to use the adaption required for the most 
troublesome points, which we have already identified. The 
required interval size in each element of the Simpson's com- 
position should be decreased until we reach: 
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In order to continue, we need to evaluate F{t) in a 
closed-form, which cannot be achieved due to Kepler's equa- 
tion. However, there exists a special case where Kepler's 
equation does yield an exact closed-form solution and this 
occurs for circular orbits since M = E = f. In such a case, 
we may write: 



z(t) 
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The ingress/egress morphology is dominated by the ex- 
pressions pertaining to a uniform source. Limb darkening 
does affect the ingress/egress curvature but this is much 
less than the amplitude of the uniform source transit sig- 
nal. In the small-planet limit, MA02 provided the following 
approximation for the ingress/egress flux: 



F{x) + 



p arccos 



[?1 



(27) 



Where we have defined z = 1 + x and it understood 
that —p < X < p for the ingress/egress. For the purposes 
of the evaluating the maximum error, we know that x ~ p 
and thus we may expand the cosine term into second order 
using a Taylor series. Let us assume we have the simple case 
of fe = which means that i — tt/Q. We make further small- 
angle approximations to simplify the resultant expression 
for the error, which is justified since 2nti <^ P. The other 
adjustment we need to account for is that so we have ap- 
proximated b = and e = 0. To generalize the result, we 
consider that the effect of 6 > and e > is to stretch or 
shrink the ingress/egress duration by a factor t/tq. There- 
fore our expressions here are actually for To, which may be 
written as To — I(ro/r). We may now rewrite equation (24) 
as; 



4v/2mp - V' - 6v/4mp - + ^/Smp - V')] | (28) 



Where we have used: 



^ _ 2naR TQ^ 



To 



(29) 
(30) 



Where S{a, (3) is Simpson's rule evaluated over the in- 
terval a to p. In our case, the integral is over time and a = ti 
and b = ti-\- (To /m) , where 2m is the number of subintervals 
we split the integral into and 2m = A'^ — I where A*' is the 
required factor by which the number of calls to the MA02 
code increases by. The reason for the subscript of by the 
T term will be explained shortly. Our requirement may be 
written as: 



Due to the approximations made, we find that this 
equation is only stable for m 2. For any given data set, 
we simply need to solve equation (28) for m with some 
sensible estimates of p, b, e, uj, an and P. As an exam- 
ple, for Kepler-5b , taking the quoted parameters from the 
iKoch et al.l (|2010h paper, we find that even using m — 2 
provides an error of O.lppm, which is well below the typical 
measurement uncertainty of i30ppm. 
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3.3.2 Limb- darkened trough- curvature 



3.4 Resampling 



Another part of the lightcurve where we have significant cur- 
vature, and thus expect the maximum numerical integration 
errors, is the limb-darkened lightcurve trough. However, the 
peak-to-peak size of the changes in flux induced by the limb 
darkening are much lower than the transit signal itself (i.e. 
5); typically an order-of-magnitude. Further, the time scale 
over which these changes act is greater than that of the 
ingress/egress curvatures (i.e. tp ^ t) except for grazing 
transits. So we can see that, in general, the errors in our 
numerical integration techniques will be dominated by the 
ingress/egress curvatures rather than the limb-darkening- 
induced lightcurve-trough curvatures. 



3.3.3 Contact point discontinuities 

The final source of variation in the lightcurve gradient is that 
of the discontinuous change located at the contact points. 
Estimating the error due to this discontinuity is most easily 
estimated by assuming a trapezoid approximated lightcurve 
and considering the location where maximal error is induced. 
The largest error (and in fact only error) will occur for 
measurements close to contact points, or more specifically 
\ti — tjvfl < 1/2 where tm is the time of one of the contact 
points. 

Before the first contact point, we have a flat line at 
F — 1 and after this point we have a linear slope with a 
gradient —{5/t). The error in Simpson's composite rule will 
depend upon the relative phasing between the centre of the 
integration and the contact point, i.e. {ti — ti). Generalized 
to any phase, the true integrated flux of the trapezoid ap- 
proximated lightcurve for the z**^ time stamp is given by: 



Ft 



_ 5 (2t,+XY 



(31) 



For each value of m = 1,2,3... we choose to set the 
phase to be such that the difference between the true inte- 
grated flux and that from Simpson's method is maximized. 
Under such a condition, it may be shown that the maximum 
error is given by: 



5 X 



Comp.Simp. 



T&(N - 1) 



(32) 
(33) 



For the system parameters of Kepler-5b, we find that 
using m = 1, 2, 3 induces a maximal error of 371ppm, 93ppm 
and 41ppm respectively. Given that the measurement uncer- 
tainties are 130ppm (Koch et al. 2010), a suitable choice for 
the resolution would be m = 2 since this means the max- 
imum possible error of a data point in the least-favourable 
phasing would be below the measurement error. 

It is interesting to see that for m = 2 the error was 
O.lppm for the ingress/egress curvature of the same sys- 
tem, suggesting the discontinuity error dominates the error 
budget. Actually, this is expected from the arguments made 
earlier in this paper. Therefore, in most applications, a selec- 
tion for m based on the error induced by the contact point 
discontinuities will provide a robust integration resolution. 



An additional method for numerically integrating the 
lightcurve is discussed here. Let us consider that we have 
observations with integrated time stamps given by the vec- 
tor A second way of calculating -F(t) is to resample the 
time vector into a very fine cadence, at which point we may 
assume F — F. Let us define our temporary resampled time 
vector as As an example, for the Kepler data, we may 
choose to resample the 30 minute integrations into 1 minute 
integrations would be done by expanding each time stamp, 
ti into a sub-vector of 30 equally spaced time stamps with 
a mean value given by ti. Our new temporary time array is 
used to generate a lightcurve using the normal MA02 expres- 
sions giving us F^{t^ (note that F here has no tilde because 
the MA02 equations can only generate instantaneous flux, 
not integrated flux). We then rebin the model lightcurve 
back to the original cadence to give F(t_). Finally, we make 
the assumption F_{t^) ~ F_{t), i.e. the high cadence resam- 
pled time vector yields a lightcurve model consistent with a 
time vector of infinite cadence. 



resample MA02 



(34) 



It can be seen that resampling into TV sub-time stamps 
will increase the computation time by a factor of ~ A*', 
since typically the MA02 subroutine uses the majority of a 
lightcurve fitting algorithm's resources (especially for non- 
linear limb darkening). In the next subsection, we will show 
that the computation times can be decreased by selective 
resampling. 

One advantage of resampling is that we can choose to 
resample in such a way as to account for read-out and dead- 
times, which may be important if the instrument's duty cycle 
is quite pooiQ. The resampling of the i*'^ time stamp into A'^ 
sub-time stamps with labels j = 1,2, ...A — 1,A'' can be 
expressed as: 



N + l \X 
2 ) N 



(35) 



The fiux of the z' ^ time stamp is found by rebinning all 
TV fiux stamps from j = 1 to j = TV. 



F ■ 

TV 



(36) 



Thus for the first few values of A = 2, A = 3 and A = 4 
we would have: 



( A = 2) = i [Fit, - X/4) + F{t,+ 1/4)] (37) 

F{N^3.) = ^[Fit, ^X/3) + F{t,) + F{t, (38) 
F,{N^4) = j [F(t, - 3X/8) -I- F{t, - X/8) 

-f F{t, + X/8) + F{t, + 3X/8)] (39) 



^ We note that this is not the case for Kepler which has a duty 
cycle of 91.4%. 
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For a trapezoid approximated lightcurve, it can be eas- 
ily shown that the error in these expressions, as a function 
of A'^, is given by: 



op 

Resampling 



5 X 



(40) 



Therefore, the resampling method yields greater 
accuracy than t he co mposite Simpson's method. In 
iKipping fc Bakoj l|2010l ). both the resampling and Simp- 
son's composite rule were employed in completely inde- 
pendent analyses and the obtained results were consistent. 
Therefore, whilst we are free to use either method discussed 
here, the most efficient approach out of the two is resam- 
pling 



Icilliland et al.l (|2010t ) reported that they used a method 
for fitting the long-cadence lightcurve of TreS-2b which 
we interpret to be effectively equivalent to the resampling 
method. The authors split the LC intervals into 30 con- 
tributing sub-intervals corresponding to the SC cadence i.e. 
A'^ = 30. For the reported LC r.m.s. noise of 66ppm and 
the sy stem parameters of TreS-2b taken from I Winn et al.l 
l|2008h , we estimate that using N — 5 would produce a max- 
imum possible error in the most unfavourably phased data 
point of 59ppm and thus using TV = 30 is excessive for this 
lightcurve. Our equations therefore permit for a reduction 
in computational time of 600%. 



3.5 Selective Resampling 

Resampling time stamps which satisfy \ti—tc\ > [tr +I)/2 
and — sc| > (st+I)/2 is unnecessary since Fi = Fi — I'm 
such cases (assuming we have folded multiple transits about 
the orbital period). Note that we define sc as the mid-time 
of the secondary eclipse and st as the duration from contact 
point I to IV of the secondary eclipse. We label this method 
of optimization as selective resampling. 

Since (tr + st) / P ~ 2/(7ra_R), this can reduce the num- 
ber of time stamps which require resampling by an order of 
magnitude for continuous staring telescopes like Kepler and 
CoRoT. It should be noted that selective resampling will not 
be possible if the lightcurve model includes phase variations 
of the planet e.g. HAT-P-7b. ,Borucki et al., (|200a 'l. 



4 CONCLUSIONS 

We have explored how long-cadence data, with particular 
focus on Kepler, causes severe systematic errors in the re- 
trieved physical parameters, unless accounted for. The effect 
is valid for any finite exposure time but increases with longer 
cadences. Long-cadence data smears out the lightcurve mor- 
phology, which acts to stretch out the ingress/egress dura- 
tion and suppress limb darkening in the lightcurve trough. 
These two effects act to increase and decrease the retrieved 
impact parameter respectively. Critically, overestimating the 
impact parameter is shown to lead to severe underestima- 
tions of the stellar density which could lead to planetary 
candidates being rejected on the basis of being unphysical. 

Numerical integration techniques permit for improved 
modeling of the transit lightcurve. We discuss two particular 
methods, the composite Simpson's method and resampling. 



We provide expressions for estimating the errors of these 
techniques and find that both methods produce an error 
which scales as A'^"^ where A^ is the numerical resolution 
of the techniques. Out of these two discussed methods, the 
resampling approach yields a greater efficiency. 
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